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FIGURE 6B 

AMINO ACID DISTRIBUTION 
OF AIDS ENV PROTEIN 
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FIGURE 6 A, 



METArg 

VdTLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuG1yf€TLeu 
METneCysSerAUThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 
HisAl aCysVa 1 ProThr AspProAsnProG I nG 1 uVal Va I LeuVal AsnVa 1 ThrG 1 uAsnPneAsn 
HETTrpLysAsnAspMETVal Gl uG 1 nHETH i sGl uAspIl elleSerLeuTrpAspGl nSerleuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETnertETGIuLysGlyGluIleLysAsnCysSerPheAsnneSerThr 
SGrlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspnelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGl uProIleProIl eHi sTyrCysAl aProAl aGl yPheAlal 1 eLeuLysCysAsnAsnlysThr 
PheAsnG 1 y ThrG 1 yProCysThrAsnVa 1 SerThrVa 1 Gl nCysThrH i sG 1 y II eArgProVa 1 Va 1 
SerThrGlnLeuleuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAl aLysThr II el 1 eVal Gl n LeuAsnThrSerValGl ul 1 eAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysneGly 
AsnMETArgGlnAlaffisCysAsnlleSerArgAULysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysleuArgGl uG 1 nPheG 1 yAsnAsn LysThrl 1 ell ePheLysG 1 nSerSerGl yGl yAspProGl u 
I 1 eVa I ThrH i sSerPheAsnCysGl yGl y<31 uPhePheTyrCysAsnSerThrGl nLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArg 1 1 elysG 1 nPhel 1 eAsnMET TrpGl nGluValGl yLysAl aMET TyrAl aProProI 1 e 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerCluleuTyrLys 
TyrLysVal Val Lysl 1 eGl uProleuGl yVal Al aProThrLysAl aLysArgArgVal Val Gl nArg 
Gl uLysArgAl aVal Gl yll eGl yAl aLeuPheleuGl yPheLeuGl yAl aAl aGl ySerThrMETGl y 
AlaAlaSerWETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuletiArgAl al 1 eGl uAl aGl nGl nHi steuLeuGI nLeuThrVal TrpGl y I leLysGl nLeuGl n 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnH i sTh rThrTrpMETGl uTrpAspArgGl u 1 1 eAsnAsnTyrThrSer Leul 1 eH i sSerLeuI 1 e 
G 1 uG 1 uSerGl nAsnG 1 nG 1 nG 1 uLysAsnGluG 1 nGl uLeuLeuGl uLeuAspLy sTrpAl aSerLeu 
TrpAsnTrpPheAsnll eThrAsnTrpLeu T rpTyrl leLysLeuPhel 1 eMET II eVal Gl yG 1 yleu 
Va 1 G 1 yLeuArgl 1 eVa 1 PheAl aVal LeuSerVa 1 Va 1 AsnArgVa 1 ArgG 1 nG 1 yTy rSerProLeu 
SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 
G 1 uArgAspArgAspArgSerl 1 eArgteuVa 1 AsnGlySerLeuAl aLeu I 1 eTrpAspAspLeaArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 
LeuGl yArgArgGl yTrpGl uAl aLeuLysTyrTrpTrpAsnLeuLeuGl nTyrTrpSerGl nGl uLeu 
LysAsnSerAl aVal SerLeuLeuAsnA 1 aThrA 1 al 1 eAl aVal Al aGl uGl yThrAspArgVa 1 lie 
G 1 uVa 1 Va 1 G 1 nGl uA 1 aTyrArgAl all eArgH i s I 1 eProArgArgll eArgGl nGl y LeuG 1 uArg 
IleLeuLeu 
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Figure 5 
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Figure 4 
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601 650 

HXB-3 KL ICTTAVPWNASWSNKSLEQIWNHTTWMEWDRE INNYTSL IHSL IE ESQ 
BH-LO MM 
BH-8 NM 
LAV NM 

ARV-2 D DNM Q E D NT YT 



651 700 

HXB-3 NQQEKNEQELLELDKWASLWNWFNITNWLWYI KLF IMI VGGLVGLR I VFA 

BH-10 

BH-8 

LAV X 
ARV-2 S I 



701 75C 

HXB-3 VLSWNRVRQGYSPLSFQTHLP IPRGPDRPEG IEEEGGERDRDRS I RLVN 
BH-10 

BH-8 I n 

LAV I t 

ARV-2 I R V D V D 



751 



300 



HXB-3 GSLAL IWDDLRSLCLFS YHRLRDLLL I VTR I VELLGRRGWEALKYWWNLL 

BH-10 

BH-8 

LAV 

ARV-2 F E R AA T : H 3 



801 



850 



HXB-3 QYWSQELKNSAVSLLNATAIAVAEGTDRVI EWQEAYRAIRHIPRR I RQG 
BH-10 G 

BH-8 N LA 

LAV q c 

ARV-2 I W T A R L H 

851 856 



HXB-3 LERILL 

BH-10 

BH-8 

LAV 

ARV-2 L 



11 - " designates a deletion of one amino acid. An empty soace 
denotes identity with HXB-3 sequence. 
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301 350 

HXB-3 NNTRKK I R IQRGPGRAFVTIGKIGNMRQ-AHCN I SRAKWNATLKQI AS KLR 
BH-10 S N D 

BH-8 D 
LAV S 

ARV-2 S Y - - H T R IGDIRK Q N E VK 



351 400 

HXB-3 EQFGNNKT I IFKQSSGGDPE IVTHSFNCGGEFFYCNSTQLFNSTWFNSTW 

BH-10 

BH-8 

LAV 

ARV-2 V N MR T N -RLNH 



401 450 

HXB-3 STEGSNNTEGSDTITLPCRIKQF INMWQEVGKAMYAPP I SGQ IRCSSN IT 

BH-10 K I 

BH-8 K I 
LAV 

ARV-2 - K N I I GS 



451 500 

HXB-3 GLLLTRDGG-NNNNGSE I FRPGGGDMRDNWRSELYKYKVVK I EPLGVAPTK 
BH-10 - S E 

BH-8 - S E 

LAV 

ARV-2 T VT DT V II 



501 550 

HXB-3 AKRRWQREKRAVGI -GALFLGFLGAAGSTMGAASMTLTVQARQLLSG I VQ 

BH-10 

BH-8 

LAV - R 

ARV-2 V M V L 

551 600 

HXB-3 QQNNLLRA IEAQQHLLQLTVWG I KQLQAR I LAVERYLKDQQLLG I WGCSG 

BH-10 G 

BH-8 

LAV 

ARV-2 V R 
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FIGURE 2 (3 pages) 

1 50 

HXB-3 MRVKEK YQHLWRWGWRWGTMLLGMLMICS ATEKLWVTVYYGVPVWKEATT 

BH-10 

^ ^ : :..] BH-8 F 
LAV K I 
ARV-2 K --GTRRN 

51 LOO 

Jj^&Lrzit HXB-3 TLFCASDAKAYDTEVHNVWATHACVPTDPNPQEWLVNVTENFNMWKNDM 

BH-LO 
^ "/T BH-8 
LAV 

ARV-2 R G N 

101 150 

HXB-3 VEQMHEDI I SLWDQSLKPCVKLTPLCVSLKCTDLKNDTNTNSS SGRMIME 

BH-LO 
BH-8 

LAV G A NTNSS E M 
ARV-2 Q T N G A NWKEEI 

151 200 

HXB-3 KGE IKNCSFNISTS IRGKVQKE YAFFYKLD I IP IDND- -TTSYTLTS CNTSV 

BH-LO 

BH-8 K 

i . . LAV 

'W^r? ARV-2 T D I N L RN W AST N NYRLIH R 

20L 250 

HXB-3 ITQACPKVSFEP IP IHYCAPAGFAI LKCNNKTFNGTGPCTNVSTVQCTHG 

BH-LO 

BH-8 

LAV A 
ARV-2 T K 

25L 300 

HXB-3 IRPWSTQLLLNGSLAEEEWIRSVNFTDNAKTI I VQLNTSVE INCTRPN 
BH-LO A Q 

BH-8 D 
LAV A Q 

ARV-2 I D N E A 
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FIGURE 1 



1 ATTCTGCAACAACTGCTGTTTATCCATTTTCAGAATTGGGTGTCGACATAGCAGAATAGGCGTTA.CTCG 69 
70 ACAGAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGC L38 
139 CTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAA 207 
208 AAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGA 276 
277 CTCATCAAGTTTCTCTATCAAAGCAGTAAGTAATACATGTAATGCAACCTATACAAATAGGAATAGTAG 345 
346 C ATT AGTAGTAGCAATAAT AAT AGC AATAGTTGTGTGGTCCATAGT AATCATAGAATATAGG AAAATAT 414 
415 TAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGAAAGAGCAGAAGACAGTGGCAATGAGA 483 
484 GTGAAGGAGAAATATCAGCACTTGTGGAGATGGGGGTGGAGATGGGGCACCATGCTCCTTGGGATGTTG 552 
553 ATGATCTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAAGGAAGCA 621 
622 ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 690 
691 CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 759 
760 ATGTGGAAAAATGACATGGTAGAACAGATGCATGaGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 828 
829 CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 897 
898 AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 966 
967 AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 1035 
1036 GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 1104 
1105 TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 1173 
1174 TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 1242 
1243 TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 1311 
1312 GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 1380 
L381 AATACAAGAAAAAAAATeCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 1449 
1450 AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 1518 
1519 AAATTAAGAGAAC AATTTGGAAATAATAAAACAATAATCTTTAAGC AATCCTCAGGAGGGGACCCAGAA 1587 
1588 ATTGT AACGCAC AGTTTT AATTGTGG AGGGG AATTTTTCTACTGT AATTCAACACAACTGTTTAATAGT 1656 
1657 ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 1 725 
1726 CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 1794 
1795 AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 1863 
1864 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATAT AAA 1932 
1933 TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 2001 
2002 GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 2070 
2071 GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 2139 
2140 TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 2208 
2209 GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 2277 
2278 AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 2346 
2347 AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAATT 2415 
2416 GAAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAATTAGATAAATGGGC AAGTTTG 2484 
2485 TGGAATTGGTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATGATAGTAGGAGGCTTG 2553 
25 5 4 GTAGGTTTAAGAATAGTTTTTGCTGTACTTTCTGTAGTGAATAGAGTTAGGCAGGGATATTCACCATTA 2622 
2623 TCGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGA 2691 
2692 GAGAG AGAC AGAGAC AG ATCCATTCG ATT AGTG AACGGATCCTTAGCACTTATCTGGGACGATCTGCGG 2760 
276 1 AGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTT 2829 
2830 CTGGGACGCAGGGGGTGGGAAGCCCTCAAATATTGGTGGAATCTCCTACAATATTGGAGTCAGGAGCTA 2898 
2899 AAGAATAGTGCTGTTAGCTTGCTCAATGCCACAGCTATAGCAGTAGCTGAGGGGACAGATAGGGTTATA 2967 
2968 GAAGTAGTACAAGAAGCTTATAGAGCTATTCGCCACATACCTAGAAGAATAAGACAGGGCTTGGAAAGG 3036 
3037 ATTTTGCTATAAGATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTGCTGTAAGGGAAAG 3105 
3106 AATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGC ATCTCGAGA 3156 
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54. The use of a protein as claimed in anyone of 
ciaims 1-8 for the preparation of a protective im- 
munisation vaccine. 
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or an equivalent thereof. 

41. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 5 
nucleotide sequence: 

ATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGCGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GGAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



20 

or an equivalent thereof. 

42. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 25 
nucleotide sequence: 

ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



49. A transformant according to claim 48 which is 

43. An expression vector according to any one of an E. coli strain, 
claims 37 to 42 which is a plasmid capable of 

replication in gram-negative bacteria. 50. A transformant according to claim 49 which is 

an E. coli MC 1061 strain. 

44. An expression vector according to claim 43 

which is capable of replication in an E. coli strain. ^ 51. Antibodies raised against a protein obtained 

according to any one of claims 1 to 12 and 33 and 

45. An expression vector according to claim 43 or 34. 
44 which is a member of the pEV/env family. 

52. The antibodies of claim 51 which are mon- 

46. An e>pression vector according to claim 45 oclonal antibodies, 
which is pEVl , -2, or -3/env 44-640. 50 

53. A vaccine eliciting immunity to AIDS compns- 

47. An expression vector according to claim 45 ing as an active ingredient a protein obtained ac- 
which is pEV1, -2, or -3/env 205-640. cording to any one of claims 1 to 12 and 33 and 



48. A transformant carrying an expression vector as 
claimed in any one of claims 37-47. 



55 



34. 



31 



57 
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58 



or an equivalent thereof. 

39, An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 5 
nucleotide sequence: 

TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTCTGCC^ 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATC 
TCAACTCAACTGCTGTTAAATGCK^AGTCTAG^ 
GACAATGCTAAAACCATAATAGTACAGCTGAACACA 
AATACAAGAAAAAAAATCCGTATCCAGAGGC<JACCAGGGAGAG<^TTT^ 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAiUlCAGATAGCrAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAC-CAAT^ 

ATTGT AACGCACAGTTTT AATTGTGG AGGGG AATTTTTCTACTGTAATTCAA ZT, C 7.ACTGTTTAATAGT 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAA- 3 ~ ~ A CACAATCACACTC 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAA"; : A 7GCCCCTCCCATC 

AGCGG AC AAATT AG ATGTTC AT CAAAT ATT AC AGGGCTGCT^ 

AATCC^TCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACA^ 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGA^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGA 

GCAGCGTCAATCACGCTGACGGTACAGGCCAGACAATTATTGTCTISGTAT^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAAC 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGA 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTT^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 30 

40. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 35 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAA 
ATTGTAAGGCACAGTTTTAATTGTGGAGGGGAATTTTTC 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCIAAATAACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGT^ 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATA^ 
AATCGGTCCtSAGATCTTCAGACCT^AC^AGGAGATATGAGGGAC^TT^ 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGC^ 

gaaaaaagagcagtgggaataggagctttcttcc7tgggttcttgggagcagcaggaagca 

gcagcgtcaatgacgctgacggtacaggccagacaattattgtctggtat^^ 

ttgctgagggctattgaggcgcaacaggatctgttgcaactcacagtct^ 

ctcaagaatcctggctgtggaaagatacctaaaggatcaacagctcctggggattt^ 

aaactaatttgcaccactgctgtgccttggaatgctagttg^ 

aatcacacgacgtggatggagtgggacagagaaattaacaattacacaagc 
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other AIDS viral proteins. 

35. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus whenever 
prepared by a process as claimed in any one of 
claims 13 to 20. 

36. A transformant carrying an expression vector 
comprising a gene coding for an envelope protein 
of an AIDS virus whenever prepared by a process 
as claimed in any one of claims 21 to 23. 

37. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus down- 
stream of a promoter sequence enabling transcrip- 
tion, translation and thus expression of said en- 
velope protein in a host cell. 

38. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 



GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGAT-ACAGAGGTACATAATGTTTGGGCCACA 
CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 
AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 
AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 
GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTG AGGGCT ATTGAGGCGC AACAGC AT CTGTTGCAACTC ACAG TCTGGGGC A TC AAGC AGC TC C AG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTG<jGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



antigen in the sample and a protein obtained ac- 
cording to 'any one of claims 1 to 12 in labeled 
form compete with an antibody against a protein 
obtained according to any one of claims 1 to 12. 

5 

30. A process according to claim 28, wherein a 
sandwich method is performed using two anti- 
bodies against a protein obtained according to any 
one of claims 1 to 12. 

w 

31. A method according to claim 30, wherein one 
antibody is on a solid phase and the other antibody 
is labeled. 

32. A method according to claim 30, wherein two is 
different monoclonal antibodies are used. 

33. An envelope protein of an AIDS virus whenever 
prepared by a process &o claimed in any one of 
claims 1 to 12. 20 

34. An envelope protein of an AIDS virus according 
to claim 33 that is a homogeneous protein free of 
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ATGTATGCCCCTCCCATC 

AGOGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTA 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACGCACCAAGGC^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTG 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTA1TTGTC 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAAC^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGOTCCTGGGG 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATC 

AAT C AC ACG ACGTGG ATGG AGTGGG ACAG AG AAA77T AACAAT T ACAC AAGC 



or an equivalent thereof is used. 

18. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 



75 



ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGCAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGG 
GCAG03TCAATGACGCTGACGGTACAGGCCAG 

TTGCTGAGG^KTTATTGAGGCGCAACAGCATCTGTTGCAACTCA^ 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACA^ 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAAT 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



30 



or an equivalent thereof is used. 

19. A process according to any one of claims 13 to 
18 wherein the expression vector is a plasmid 
capable of replication in gram-negative bacteria. 

20. A process according to claim 19 wherein the 
plasmid is capable of replication in an E. coli strain. 

21 . A process for the preparation of a transform ant 
carrying an expression vector comprising a gene 
coding for an envelope protein of an AIDS virus, 
which process comprises transforming a microor- 
ganism with an expression vector obtained accord- 
ing to any one of claims 13 to 20 and cultivating 
the transformed microorganism. 

22. A process according to claim 21 wherein the 
microorganism is an E. coli strain. 

23. A process according to claim 22 wherein the 
microorganism is an E. coli MC 1061 strain. 

24. A process of testing human blood for the pres- 
ence of antibodies to the viral etiologic agent of 
AIDS which process comprises mixing a composi- 



40 



tion containing an evelope protein of an AIDS virus 
obtained according to any one of claims 1 to 12 
with a sample of human blood and determining 
whether said envelope AIDS protein binds to AJDS 
antibodies present in the blood sample. 

25. A process according to claim 24 which com- 
prises the use of the Western Blotting Analysis. 

26. A process according to claim 24 which com- 
pnses the use of an Elisa-technique, wherein an 
envelope protein of an AIDS virus obtained accord- 
ing to any one of claims 1 to 12 is coated on a 
solid phase and contacted with the sample and 
after washing contacted with an enzyme-labeled 
non-human IgG. 

27. A process according to claim 24. wtierein the 
Double-Antigen-Method is used. 

28. A process for the determination of MDS virus, 
wherein antibodies against ar enve+oo* orotein of 
an AIDS virus obtained acccv: -g ary one of 
claims 1 to 12 are used. 
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TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACRTTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof is used. 

16. A process according to claim 13 characterized 

in that as a gene coding for an envelope protein of 2 5 
an AIDS virus a gene comprising the nucleotide 
sequence 

ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATA3\ATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof is used. 45 

17. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 50 
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is used. 

8. A process according to any one of claims 1 to 7 
wherein the host ceil is a bacterium. 

9. A process according to claim 8 wherein the 
bacterium is E. coli. 

10. A process according to claim 9 wherein the 
expression vector is a plasmid of the pEV/env 
family. 

11. A process according to claim 10 wherein the 
plasmid is pEV1 , -2, or -3/env 44-640. 

12. A process according to claim 10 wherein the 



plasmid is pEVl , -2, or -3/env 205-640. 

13. A process for the preparation of an expression 
vector comprising a gene coding for an envelope 

5 protein of an AIDS virus, which process comprises 
constructing an expression vector having an inser- 
tion site wherein said gene may be inserted which 
insertion site is downstream of a promoter se- 
quence enabling transcription, translation and thus 

m expression of said envelope protein in a host cell. 

14. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 

is sequence 



i\j jRHKj^PJVJ<SA 

ACCACCACTCTATTTTGTCKTATCAGATGCT 
CATCKICTGTGTACCCACAGACCCCAACCCAC 
ATGTGGAAAAATGACATGGTAGAJ\CAGATGC 
CCATGTCTAAAATTAACCCCACTCTGTC 

AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCT 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAAC 

GATACTACCAGCTATACGTTGACAAGTTGTAACACOT 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGC 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGT^ 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAA 

GACAATGCTAAAACCATAATAGTACAGCTGAACACA 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAAC^ 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACC 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTC 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTC^ 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAA^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTA^ 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATT 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAG<^CCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTT^ 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTG<^GCAGCAGAACAA 
TTGCTGAGGGCTATTGAGG^AACAGCATCTGTTGCAA^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAACK3ATCAACAGCTCCTGGGGATTTGGGGTTGCT 
AAACTAATTTGCACCACTGCTGTGCCTTGGA^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivalent thereof is used. 



15. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 
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is used. AIDS virus a gene capable of effecting expression 

of the protein comprising the amino acid sequence 
5. A process according to ciaim 1 , characterized in of: 
that as a gene coding for an envelope protein of an 

5 



METArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrtleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProlle 
SerGLyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGLuIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslLeGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGLyAlaLeuPheLeuGLyPheLeuGlyAlaAiaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGLnGlnGinAsnAsn 
LeuLeuArgAlalleGLuAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGlaArgTyrLeuLysAspGlnGlnUeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGlulleAsnAsnTyrThrSer 



is used. 

6. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



METTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeaPheLeuGLyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGLnLeaGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnileTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



is used. 

7. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 

45 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



METArgAspAsnTrpArgSerGLuLeuTyrLys 
TyrLysValValLyslLeGLuProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGLy 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleVaLGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGLnLeuThrValTrpGlylleLysGlnLeuGlr. 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGInGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHtsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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ValTrpLysGluALa 

ThrThrThrLeuPheCysAlaSerAspAlaLysAUTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlriGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMEWalGluGlnHETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysVaLSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGLyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProlLeAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGinAlaCysProLysValSer 

PheGLuProIleProIleHisTyrCysAlaProAlaGLyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylLeArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGLuGluGluValVallLeArgSerVaLAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGLuIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglLeGlnArgGlyProGlyArgAlaPheValThrlLeGlyLysrieGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgALaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGLuGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGl/^spProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnL-^rheAsaSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspT "leThrLea 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluVaLGlyLysAlaMETTyrA: roProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyG:. .snAsnAsn 

AsnGlySerGluIlePheArgProGLyGlyGlyAspMETArgAspAsnTrpArgSerGi-^-uTyrLys 

TyrLysValValLysIleGLuProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylLeGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

ALaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGLnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

ALaArglLeLeuAlaValGLuArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCYsSerGly 

LysLeuileCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

' S US6d " A,DS vir us a gene capable of effecting expression 

30 of the protein comprising the amino acid sequence 

4. A process according to claim 1, characterized in of: 
that as a gene coding for an envelope protein of an 

. , CysProLysValSer 
PheGluProlLeProlLeKi-TyrCysAlaPrc aGlyPheAlaIle\.^LysCysAsrmsnLysThr 
PheAsnGlyThrGlyPrcOsThrAsnValSer -ValGlnCysThr- ^GlylleArgProValVal 
SerThrGlnLeuLeuLeur,3nGlySerLeuA:., -GLuGluValVaL / ^ArgSerValAsnPheThr 
AspAsnAlaLysThrlLell^/alGinLeuAs- rSerValGluIleA. . , JysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGli^rcOlyArgALaPheVaimrlleGlyLysrieGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGLnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGLyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGLySerAsnAsnThrGLuGlySerAspThrlleThrLeu 
ProCysArgXleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGLySerGluIlePheArgProGlyGlyGlyAspHETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValVaLGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGLySerThrMETGly 
AlaAlaSerMEtThrLeuThrVaLGLnAlaArgGlnLeuLeuSerGlylleVaiGLnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
ALaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylLeTrpGlyCysSerGLy 
LysLeuIleCysThrThrALaValProTrpAsnALaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHtsThrThrTrpMETGluTrpAspArgGluIIeAsnAsnTyrThrSer 
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METArg 

ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGiyThrHETLeuLeuGlyMETLeu 

METIleCysSerAlaThrGluLysLeuTrpVaLThrValTyrTyrGlyValProValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnVaLTrpAlaThr 

HisAlaCysValProThrAspProAsnProGLnGluValValLeuValAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMETValGlaGlnMETHisGluAspI lelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspllelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsaAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeaLeuLeuAsnGlySerLeuAlaGluGlLiGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGlulleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysIleGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGLnSerSerGlyGLyAspProGla 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglLeLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGLyAsnAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLysIleGluProLeuGlyvalAlaProThrLysAlaLysArgArgValValGlnArg 

GLuLysArgALaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluALaGlnGLnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsriLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 

GluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpA laser Leu 

TrpAsnTrpPheAsnileThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 

ValGlyLeuArglleVaLPheAlaValLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 

SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 

SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuLeuAsnALaThrAlalleALaValAlaGluGlyThrAspArgVallLe 

GluValValGlnGluALaTyrArgAlalleArgHisIleProArgArglleArgGLnGlyLeuGluArg 
IleLeuLeu 



is used. 

3. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



so 
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ence of antibodies to the viral etiologic agent of 
AIDS which comprises mixing a composition con- 
taining an envelope protein of an AIDS virus as 
claimed in any one of claims 1 to 8 with a sample 
of human blood and determining whether said en- 5 
velope AIDS protein binds to AIDS antibodies 
present in the blood sample. 

30. A method according to claim 29 which com- 
prises the use of the Western Blotting Analysis. 10 

31. A method according to claim 29 which com- 
prises the use of an ELfSA-technique, wherein an 
envelope protein of an AIDS virus as claimed in 

any one of claims 1 to 8 is coated on a solid phase is 
and contacted with the sample and after washing 
contacted with an enzyme-labeled non-human igG. 

32. A method according to claim 29, wherein the 
Double-Antigen-Method is used. 20 

33. A method for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus according to any one of claims 1 to 

8 are used. 25 

34. A method according to claim 33, wherein the 
antigen in the sample and a protein as claimed in 
any one of claims 1 to 8 in labeled form compete 

with an antibody against a protein as claimed in 30 
any one of claims 1 to 8. 

35. A method according to claim 33, wherein a 
sandwich method is performed using two anti- 
bodies against a protein as claimed in any one of 35 

r.a.ms 1 to 8. 

_ A method according to claim 35, wherein one 
an::Dody is on a solid phase and the other antibody 
is labeled. ^ 

37. A method according to claim 35, wherein two 
different monoclonal antibodies are used. 



38. A vaccine eliciting immunity to AJDS compris- 
ing as an active ingredient a protein as claimed in 
any one of claims 1 to 8. 

39. Antibodies raised against a protein as claimed 
in any one of claims 1 to 8. 

40. The antibodies of claim 39 which are mon- 
oclonal antibodies. 

41. The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 

42. The use of a protein as claimed in any one of 
clams 1 to 8 for testing human blood for the 
presence of AIDS virus. 

Claims for Austria: 

1. A process for the preparation of an envelope 
protein of an acquired immune deficiency syn- 
drome (AIDS) virus comprising: 

transforming a host cell with an expression vector 
comprising a gene coding for an envelope protein 
of an AIDS virus downstream of a promoter se- 
quence enabling transcription, translation and ex- 
pression of said envelope protein in said host cell; 

culturing said host cell so that said envelope pro- 
tein of an AIDS virus is expressed; and 

extracting and isolating said envelope protein of an 
AIDS virus. 

2. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 
AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



50 



22 



39 



0 199 301 



40 



ATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATG<5GTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTG<3AGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GC AAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 

14, An expression vector according to claim 9 
wherein said gene coding for an envelope protein '5 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 

ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



15. An expression vector according to any one of 
claims 9 to 14 which is a plasmid capable of 
replication in gram-negative and/or gram-positive 
bacteria. 

16. An expression vector according to claim 15 
which is capable of replication in an E. coli strain. 

17. An expression vector according to claim 15 
which is capable of replication in a B. subtilis 
strain. 

18. An expression vector according to claim 15 or 
16 which is a member of the pEV/env family. 

19. An e>pression vector according to claim 18 
which is pEV1 . -2. or -3/env 44-640. 

20. An expression vector according to claim 18 
which is pEVl, -2, or -3/env 205-640. 

21 . A transformant carrying an expression vector as 
claimed in any one of claims 9 to 20. 

22. A transformant according to claim 21 which is 
an E. coli strain. 



30 
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23. A transformant according to claim 22 which is 
an E. coli MC 1061 strain. 

24. A transformant according to claim 21 which is a 
B. subtilis strain. 

25. A transformant according to claim 21 which is a 
eucaryotic cell. 

26. A method of producing an envelope protein of 
an acquired immune deficiency syndrome virus as 
claimed in any one of claims 1 to 8 comprising: 

transforming a host cell with an expression vector 
as claimed in any one of claims 9 to 20; 

culturing said host cell so that said AIDS env 
protein is expressed; and, 

extracting and isolating said AIDS env protein. 

27. A method according to claim 26 wherein the 
expression vector is pEV1, -2 or -3/env 44-640. 

28. A method according to claim 26 wherein the 
expression vector is pEV1. -2 or -3/env 205-640 

29. A method of testing human blood for the ores- 



21 



37 



0 199 301 



38 



TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCC^ 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACA^ 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCKAACACATCTGTAGAAATTA^ 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAAT^^ 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGA 

AT TGT AACGC AC AGTTTT AAT TGTGG AGGGG AATTTTTCT ACTGT AATTCAAC ACAACTGTTT AAT AGT 

ACTTGGTTTAATAGTACTTGGAGTACTGAAG<3GTCAAATAACAC^ 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCA^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTA 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTC 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGGAC 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGG^ 

gcag<xtcaatgacgctgacggtacaggccagacaatt^^ 
ttgctgagcxx^tattgaggcgcaacagcatctgtt^ 
gcaagaatcctggctgtggaaagatacctaaaggatcaacagctc^ 
aaactaatttgcaccactgctgtgccttggaat^^ 
aatcacacgacgtggatggagtgggacagagaaattaacaattacacaagc ^ 

or an equivalent thereof . 

12. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 25 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 



ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATC^ 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCT 
ACTTGGTTTAATAGTACTTGGAGTACTCAAGGGTCAAATAACAC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAG^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

aatgggtccgagatcttcagacctggaggaggagatatgagg<;acaattggagaagtgaattatataaa 
tataaagtagtaaaaattgaaccattaggagtagcaccca^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGG^ 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTC 
TTGCTGACK^GCTATTGAGGCGCAACACKZATCTGTTGCAACT^ 
GC AAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGC^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTG^ 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivalent thereof. 

13. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 
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10. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 

5 

G TGTGG AAGG AAGC A 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 
CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 
ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 
AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 
AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 
GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 
TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCXJATTCTAAAATGTAATAATAAGACG 
TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 
TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 
GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 
AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 
AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 
AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

35 

or an equivalent thereof. 

11. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 4Q 
nucleotide sequence: 
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5- An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



METArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGinlleAlaSer 
LysLeoArgGLuGLnPheGlyAsnAsnLysThrllellePheLysGlaSerSerGlyGiyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGLuPhePheTyrCysAsnSerThrGlaLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
Prc^sArglleLysGLnPhelleAsnMETTrpGlnGluValGLyLysAldMETTyrAlaProProIle 
SerGLyGlnrieArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGLyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysVaLValLyslLeGiuProLeuGlyValALaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaVaLGlylLeGlyAlaLeuPheLeuGlyPheLeuGlyAlaAUGlySerThrMETGly 
AlaALaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlyiieLysGlnLeuGLn 
AlaArglleLeaALaValGLuArgTyrLeuLysAspGlnGLnLeuLeuGlylleTrpGlyCysSerGLy 
LysLeuIleCysThrThrAlaVaLProTrpAsnALaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluILeAsnAsaTyrThrSer 



6. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 

25 



„ ^ _ n , METTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGLyAsnAsnAsn 
AsnGLySerGlulUPheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslleGloProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlaLeaLeuSerGlylleVaLGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrVaLTrpGLylleLysGlnLeuGln 
AlaArglleLeuAlaValGLuArgtyrLeuLysAspGlQGinLeuLeuGlyrleTrpGLyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGLnlleTrp 
AsnHIsThrThrTrpKETGluTrpAspArgGluIleAsnAsnTyrThrSer 



7. An envelope protein of an AIDS virus according 

to claim 1 with the amino acid sequence: *o 

METArgAspAsnTrpArgSerGLuLeuTy rLys 

TyrLysValVaLLysIleGluProLeuGlyValAlaProThrLysAULysArgArgValValGlriArg 

GluLysArgAlaValGlylleGlyAlaLeuPheLeuGLyPheLeuGlyALaAlaGlySerThrMETGly 
Al^laSerKETThrLeuThrValGl^ 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGLnLeuThrValTrpGiyTleLysGLnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsoAsnTyrThrSer 

9. An expression vector compnsi^g j ;*ra coding 
for an envelope protein of an a,c$ .<-.% down- 
stream of a promoter sequence v^tt^g ~ inscrip- 
tion, translation and thus expression * \Aid en- 
velope protein in a host cell. 



8. An envelope protein as claimed in any one of 
claims 1 to 7 that is a homogeneous protein free of 
other AIDS viral proteins. 
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ValTrpLysGluAla 

ThrThrThrLeuPheCysAlaSerAspALaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlnGIuValValLeuVaLAsnValThrGluAsnPheAsn 

METTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnALaCysProLysValSer 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGiyThrGlyProCysThrAsnValserThrValGLnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGlulleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGLyArgAlaPheValThrlleGlyLysrieGly 

AsnMETArgGlnAlaHisCysAsnlleSerArgAULysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLea 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAiaMETTyrAlaProProlle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

AsnGlySerGLuIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 

GluLysArgAlaValGlylleG-LyAlaLeuPheLeuGlyPheLeuGXyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGLnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 

4. An envelope protein of an AIDS virus according 30 * 
to claim 1 with the amino acid sequence: 



PheGluProlleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrVaLGlnCysThrHisGlyileArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAUGluGluGluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThfSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheVaLThrlleGlyLysIleGly 
AsnMETArgGlnAUHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlrtLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsoMETTrpGlnGluValGlyLysALaHETTyrAlaProProILe 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGlullePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslUGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGLylleGlyAlaLeuPheLeuGLyPheUeuGlyAlaAUGlySerThrMETGLy 
AlaAlaSerMETThrLeuThrValGlnAlaArgGLnLeuLeuSerGlylleVaLGlnGlnGlriAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGinLeuThrValTrpGlylleUysGLnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrALaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGLuGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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MET Ar g 

VaLLysGluLysTyrGlaHisLeuTrpArgTrpGlyTrpArgTrpGlyThrHETLeuLeiXJlyMETLeLi 
METIleCysSerAlaThrGlaLysLeuTrpValThrVaLTyrTyrGLyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluVaLHisAsnVaLTrpAlaThr 
HisAlaCysValProThrAspProAsnProGLnGiuValValLeuValAsnValThrGluAsnPheAsn 
HETTrpLysAsnAspMEWalGluGlnMETHisGluAspIlelleSerLeuTrpAspGLnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgHETrLeMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysVaLGlnLysGluTyrALaPhePheTyrLysLeuAspIlelleProHeAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerValXLeThrGloAlaCysProLysValSer 
PheGluProIleProIleHisTyrCysAlaProAlaGLyPheAlaHeLeuLysCysAsnAsriLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVaLlleArgSerValAsnPheThr 
AspAsnAlaLysThrlLelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysIleArglleGLnArgGlyProGlyArgAlaPheValThrlleGlyLysrieGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGLuGlnPheGlyAsoAsnLysThrllellePheLysGlnSerSerGlyGLyAspProGlu 
IleVaLThrHisSerPheAsnCysGLyGLyGLuPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlriPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProlle 
SerGLyGlnlLeArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGLuIlePheArgProGlyGlyGlyAspHETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeLtGLyAlaALaGlySerThrMETGly 
A i aA 1 as e r METTh r LeuThr Va 1G 1 nA 1 aAr gG InLe uLeuSe rG Ly 1 1 eVa 1G InG 1 nG L nAs nAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
ALaArgHeLeuALaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlavalProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpHETGluTrpAspArgGluIleAsnAsnTyrThrSerLeulleHisSerLeulLe 
GluGluSerGlriAsnGlnGlrKjluLysAsnGluGLnGluLeuLeuGluLeuAspLysTrpALaSerLeu 
TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTYrlleLysLeuPhelleHETIleValGLyGlyLeu 
ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnGLyTyrSerProLeu 
SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGLuGlylleGluGluGluGlyGly 
GLuArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuHeValThrArglleValGluLeu 
LeuGlyArgArgGlyTrpGluALaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrALalleAlaValAlaGluGlyThrAspArgVallle 
GluValValGlnGluAlaTyrArgAUlleArgHisIleProArgArglleArgGLnGlyLeuGluArg 
IleLeuLeu 

3. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 



45 



50 



55 



16 



27 



0 199 301 



28 



Claims 



1. An envelope protein of an acquired immune 
deficiency syndrome (AIDS) virus comprising the 5 
amino acid sequence: 



METArg 

ValLysGluLysTyrGlnHisLeuTrpArgTrpGlyTrpArgTrpGlyThrMETLeuLeuGlyMETLeu 

METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluALa 

ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 

HisAlaCysValProThrAspProAsnProGlnGluValValLeuValAsnValThrGluAsriPheAsn 

METTrpLysAsnAspMETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 

ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 

AsnSerSerSerGLyArgMETIleMETGluLysGlyGluIIeLysAsnCysSerPheAsnlleSerThr 

SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 

AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysVaLSer 

PheGluProIleProIleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 

PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlylleArgProValVal 

SerThrGlnLeuLeuLeuAsnGlySerLeuAUGluGluGluValVallleArgSerValAsnPheThr 

AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 

AsnThrArgLysLysIleArglLeGlnArgGlyProGlyArgAlaPheValThrlleGlyLysileGly 

AsnMETArgGinAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 

LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 

IleValThrHisSerPheAsnCysGlyGlyGLuPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

AsnGlySerGLuIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

TyrLysValValLysIleGluProLeuGLyValAlaProThrLysAlaLysArgArgValValGlnArg 

GLuLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGLnAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGLuArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGlulleAsnAsnTyrThrSerLeuIleHisSerLeuIle 

GluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeiiAspLysTrpAlaSerLeu 

TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 

ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgVaUrgGLnGlyTyrSerProLeu 

SerPheGlnThrHisLeuProlXeProArgGlyProAspArgProGluGlylLeGluGluGLuGlyGly 

GluArgAspArgAspArgSerlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 

SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 

LeuGLyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeu 

LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgVaLlle 

GluValValGlnGluAlaTyrArgAlalleArgHislleProArgArglleArgGlnGlyLeuGIuArg 
TleLeuLeu 



or fragments thereof. 

50 

2. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 
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The size and shape of epitopes found in carbohy- 
drate antigens have been extensively studied, but 
less is known about the structure of epitopes from 
protein molecules. Some epitopes of protein anti- 
gens have been defined at the level of their tertiary 
structure. In every instance, the epitopes were 
formed not by the primary sequences alone, but by 
the juxtaposition of residues brought together by 
the folding of the polypeptide chain(s) of the native 
molecule. In addition, the structure of the 68Kd env 
protein of the instant invention makes it particularly 
well suited for use as a vaccine. The 68Kd env 
protein comprises a large portion of the gene prod- 
uct which (a) was shown to be reactive with all the 
AIDS sera tested; and (b) has the unique structural 
feature of containing both an extracellular hydro- 
philic region and the transmembrane hydrophobic 
regions. The latter structural feature makes it well 
suited for use as a vaccine using liposome encap- 
sulation to create a vehicle for administration. 

Routes of administration, antigen dose, number 
and frequency of injections are all matters of op- 
timization within the scope of ordinary skill in the 
art. particularly in view of the fact that there is 
experience in the art in providing protective immu- 
nity by the injection of other related antigens to 
provide immunity in other viral infections. It is an- 
ticipated that the principal value of providing immu- 
nity to AIDS infection will be for those individuals 
who have had no previous exposure to AIDS,. e.g., 
individuals who are in the high risk population, such 
as homosexuals, drug addicts and people from 
Haiti and Central America and individuals who may 
be receiving blood transfusions. It is also antici- 
pated that temporary immunity for infants may be 
provided by immunization of mother* during preg- 
nancy. "—^ 

Example 7 



Diagnostic Test for AIDS 

It is clear that the env gene proteins of the 
instant invention may be used as diagnostic 
reagents for the detection of AIDS-associated anti- 
bodies. It is also apparent to one of ordinary skill 
that a diagnostic assay for AIDS using polyclonal or 
monoclonal antibodies to the AIDS env protein of 
the instant invention may be used to detect the 
presence of the AIDS virus in human blood. In one 
embodiment a competition immunoassay is used 
where the antigenic substance, in this case the 
AIDS virus, in a blood sample competes with a 
known quantity of labelled antigen, in this case 
labelled AIDS env protein, for a limited quantity of 
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antibody binding sites. Thus, the amount of label- 
led antigen bound to the antibody is inversely 
proportionaJ to the amount of antigen in the sam- 
ple. In another embodiment, an immunometric as- 
say may be used wherein a labelled AJDS-env 
antibody is used. In such an assay, the amount of 
labelled antibody which complexes with the 
antigen-bound antibody is directly proportionaJ to 
the amount of antigen (AIDS virus) in the blood 
sample. In a simple yes/no assay to determine 
whether the AIDS virus is present in blood, the 
solid support is tested to detect the presence of 
labelled antibody. In another embodiment mon- 
oclonal antibodies to AIDS env protein may be 
used in an immunometric assay. Such monoclonal 
antibodies may be obtained Joy methods well 
known in the art, particularly the" process of Milstein 
and Kohler reported in Nature 25§j495-497 (1975). 

The immunometric assay method is as follows: 
Duplicate samples are rur. in which 100 ul of a 
suspension of antibody immobilized on agarose 
particles is mixed with 100 ul of serum and 100 til 
of soluble ^-labelled antibody. Tnis mixture is for 
specified times ranging from one quarter hour to 
twenty four hours. Following the incubation periods 
the agarose particles are washed by addition of 
buffer and then centrifuged. After removal of the 
washing liquid by aspiration, the resufting pellet of 
agarose particles is then counted for bound m l- 
labelled antibody. The counts obtained for each of 
the complexes can then be compared to controls. 

While the invention has been described in 
terms of certain preferred embodiments, modifica- 
tions obvious to one with ordinary skill in the art 
may be made without departing from the scope of 
the invention. Fo**xample, it is understood that the 
env AIDS DNAs- iftficribed herein represent only 
the precise stnjitftGire* of two naturally occurring 
gene segments, ft is expected that slightly modified 
alleles will be found encoding for similarly function- 
ing proteins, and such gene segments and proteins 
are considered to be equivalents for the purpose of 
this invention. It is also suspected that other vari- 
ants in addition to those described herein will be 
found and that the envelope protein of said variants 
will differ slightly. These variant envelope proteins 
are likewise considered within the scope of the 
invention. DNA having equivalent codons is consid- 
ered within the scope of the invention, as are 
synthetic gene segments that encode homologous 
proteins of the virai envelope. 

Various features of the invention are set forth in 
the following claims. 
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form. It is also foreseeable that monoclonal anti- 
body affinity chromatography columns utilizing env 
AIDS polyclonal or monoclonal antibodies, could be 
used as an alternative to HPLC. 

Example 5 



Polyacryiamide gel electrophoresis and Western 

blot analysis 7 o 

Cells were lysed by resuspending the cell pel- 
lets (approximately 10' cells) in TG buffer (10 mM 
Tris, pH 7.4, 10% glycerol), mixed with an equal 
volume of 2 x sample buffer [Laemmli, U.K., rs 
"Cleavage of Structural Proteins During the Assem- 
bly of the Head of Bacteriophage T4", Nature 227, 
680-685 (1970)] and incubated at 95 °C for five (5) 
minutes. Cell debris were pelleted by centrifugation 
and the cleared lysates were subjected to SDS- 20 
PAGE analysis [Laemmli, U.K., supra]. For Western 
blot analysis, the proteins from the acrylamide gel 
were electroblotted onto a 0.1 urn nitrocellulose 
membrane (Schleicher and Schuel!) for 16 hr at 
50V, in 12.5 mM Tris. 96 mM glycine, 20% metha- 2s 
nol, 0.01% SDS at pH 7.5. Processing of the blot 
was carried out using the methods described by 
Towbin, H. et al. ["Electrophoretic Transfer of Pro- 
teins From Polyacrylamide Gels to Nitrocellulose 
Sheets: Procedure and Some Applications", Proc. 30 
Natl. Acad. Sci. U.S.A., 76, 4350-4354, (1979)]. For 
treatment with the human sera, the blots were 
incubated with a 1000 fold dilution of the sera in 
antibody buffer (20 mM sodium phosphate buffer, 
pH 7.5. containing 0.5 M NaCl, 1% BSA and 0.05% 35 
Tween 20) for 2-6 hr. The blots were then washed 
twice with phosphate buffered saline containing 
0.05% Tween 20 and then incubated with 125-1- 
labelled Staphylococcus aureus protein A for an 
additional period of 1 hr. The blot was then washed 40 
twice in PBS-Tween 20 buffer, dried and auto- 
radiographed. 

Example 6 



Immunization with Env Protein of AIDS Virus 

It is clear that in spite of the divergence ob- 
served between HTLVIH and ARV-2 sequences, the 50 
recombinant proteins synthesized by the bacteria 
react well with AIDS patients' sera derived from 
both geographical locations of the United States. 
One hundred percent (100%) of the AIDS patients' 
sera tested showed high reactivity (50 individual 55 
samples, 25 from the east coast of the United 



States and 25 from the west coast of the United 
States). Thus all the env proteins contain at least 
one conserved epitope. All of the human sera from 
AIDS patients tested contained antibodies to the 
env proteins of the instant invention. This strongly 
suggests that these env proteins with the con- 
served epitopes would be immunogenic in man. 

It will be readily appreciated that the env pro- 
teins of the instant invention can be incorporated 
into vaccines capable of inducing protective immu- 
nity against the AIDS virus. By methods known in 
the art, the specific amino acids conprismg the 
epitopes of" the env protein may be determined. 
Peptides may then be synthesized, comprising an 
amino acid sequence corresponding to an epitope 
of an env AIDS protein either in monomenc or 
multimeric form. These synthetic peptides may 
then be incorporated into vaccines capable of in- 
ducing protective immunity against AIDS virus. 
Techniques for enhancing the antigenicity of such 
peptides include incorporation into a multimeric 
structure, binding to a highly immunogenic protein 
carrier, for example, keyhold limpet hemocyanin. or 
diphtheria toxoid, and administration in combination 
with adjuvants or any other enhancers of immune 
response. In addition, the vaccine composition may 
comprise antigens to provide immunity against oth- 
er diseases in addition to AIDS. 

An amino acid sequence corresponding to an 
epitope of an env protein either in monomeric or 
multimeric form - (peptide) may be obtained by 
chemical synthetic means or by purification from 
biological sources including genetically modified 
microorganisms or their culture media. The peptide 
may be combined in an amino acid sequence with 
other peptides including fragments of other pro- 
teins, as for example, when synthesized as a fusion 
protein, or linked to other antigenic or non-antigenic 
peptides of synthetic or biological origin. The term 
"corresponding to an epitope of a env protein" will 
be understood to include the practical possibility 
that, in same instances, amino acid sequence vari- 
ations of a naturally occurring peptide may be 
antigenic and confer protective immunity against 
AIDS infection. Possible sequence variations in- 
clude, without limitation, amino acid substitutions, 
extensions, deletions, interpolations and combina- 
tions thereof. Such variations fall within the con- 
templated scope of the invention provided the pep- 
tide containing them is antigenic and antibodies 
elicited by such peptide cross-react with naturally 
occurring env protein or non-variant repeated pep- 
tides of env protein, to an extent sufficient to Di- 
vide protective immunity when administered as a 
vaccine. Such vaccine compositions wtll ce com- 
bined with a physiologically acceptable medium. 
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gested with Stul. Again, upon recircularization and 
blunt-end ligation, the EcoRI site was regenerated; 
however, an additional 483 bp of env coding se- 
quences were removed. 

Example 3 



Bacterial Growth and induction of env Gene Ex- 
pression 

Cultures of E. coli strain MC 1061 transformed 
with plasmid pRK248clts and the pEV1, -2. or - 
3/env plasmids were grown in M9 medium contain- 
ing 0.5% glucose and 0.5% casamino acids at 
30 °C to mid-log phase and then induced by shift- 
ing to 42°C for 2 hr. The cells were collected by 
centri fugation and processed as described in Ex- 
amples 4 and 5. 

Example 4 



Expression and Purification of Env AIDS 

A homogeneous recombinant viral env AIDS 
was purified according to the following procedure." 
The env AIDS protein expressed by a microbe 
tends to associate with the membrane fractions of 
the host microbe, principally the inner membrane 
of the microbe. The following purification method 
was designed to deal with this finding. 

This purification method comprises: 

(a) lysis of transformed microbial cells pro- 
ducing recombinant env AIDS protein; 

(b) separation of env AIDS associated cel- 
lular membranes from other cellular compo- 
nents; 

(c) extraction of env AIDS from associated 
membranes; and 

(d) chromatographic purification of the resul- 
tant extraction solution containing env AIDS 
to yield a substantially pure recombinant 
viral env protein. 

More specifically, the preferred purification 
method for the preparation of substantially pure 
recombinant viral env protein comprises: 

(a) cultivating a transformed organism con- 
taining a DNA sequence which codes for 
viral env protein; 



(b) causing a culture of me transformed or- 
ganism of step (a) to accumulate the env 
protein; 

5 (c) lysing the culture of transformed organ- 

isms of step (b) to form a cell lysate mixture; 

(d) isolating the ceil membrane components 
of the cell lysate mixture of step (c); 

w 

(e) washing the isolated cell membrane com- 
ponents with an extraction solution to yield a 
wash solution containing env protein; and 

75 (0 chromatographically punfying the wash 

solution of step (e) t- Yield a substantially 
pure env AIDS proteir 
In carrying out this mer-cd it is preferred that 
the cells be lysed by son; ;jtion, aJthough it is 
20 forseeabie tnat other known -ethods such as en- 
zyme or mechanical lysis co- ; aJso be used. It is 
preferred that the cell membrane component spe- 
cifically the inner and outer membranes, be iso- 
lated from other cellular components by methods 
25 such as centrifugation. It has been found that env 
AIDS expressed by the transformed microorganism 
tends to become associated with the cellular mem- 
• branes. Therefore, isolation of these membranes 
during the purification process ensures high pu- 
30 rification levels and high purity env AIDS at the end 
of the purification procedure. 

Once the cell membranes are isolated from the 
lysate mixture, they are washed with an extraction 
solutior r-eferably salt solutions and a detergent 
35 to yielc » ?:^cond solution containing approximately 
50% &-v a. OS protein. Preferably the cell mem- 
branes are cashed in four separate steps wfth the 
salt solutions and detergent aJthough it is forseea- 
ble that certain of these steps could be combined, 
40 rearranged or eliminated. The first step of washing 
the cell membrane may be done with a salt solu- 
tion, preferably 1M NaCl. In the second step the 
eel! membrane is washed with a detergent solution, 
preferably 1% Triton X-100. In the third step, the 
45 cell membrane is washed with another salt solution, 
1.75M to 3.5M guanidine HCI. The final wash is 
also with a salt solution preferably about 7M 
Guanidine HCI. The wash solution which results 
from the fourth and final wash comprises about 
50 50% env AIDS. 

The final 50% env AIDS wash solution is then 
further purified by a chromatography step, prefer- 
ably reverse phase high perform arce liquid 
chromatography (HPLC). The HPLC sieo ytelds 
55 env AIDS protein in' a substantially 100% pure 
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lowed by ethanol precipitation. Approximately 1 ug 
of DNA from each reaction was mixed with 90 
pmoles of phosphorylated Bglll linkers 
(CAGATCTG, purchased from Collaborative Re- 
search) and incubated with T4 DNA ligase at 15°C 5 
for 18 hours. The ligation products were then di- 
gested with Bglll and Pstl and subjected to gel 
electrophoresis in 1% agarose. The 3600 bp and 
760 bp fragments from both reactions were recov- 
ered from the gel. For the construction of pRC2. jo 
the 3600 bp from the Klenow reaction was ligated 
to the 760 bp fragment from the S1 reaction. To 
construct a plasmid with the Bglll site on the other 
side of EcoRI (tet R side), designated pRCI. the 
3600 bp fragment from the S1 reaction was ligated 75 
to the 760 bp fragment from the Klenow reaction. 
E. coli strain RRI (ATCC No. 31343) was trans- 
formed with the ligation mixtures, and transfor- 
mants were selected on LB agar plates containing 
50 ug/ml ampicillin. Transform ants containing the 20 
expected plasmid constructions were identified by 
restriction analysis of the isolated plasmid DNA. 
DNA sequence analysis confirmed that the S1 
nuclease treatment precisely removed the 5* 
single-stranded termini. 25 

pRC23 (see Figure 7) was constructed by in- 
serting into pRC2 a 250 bp Bglll-Haelll fragment 
containing the X P L promoter joined to a pair of 
complementary synthetic oligonucleotides compris- 
ing a model ribosome-binding site (RBS). The 30 
Haelll site is located within the 5' non-coding re- 
gion of the XN gene 115 bp downstream of the 
P L transcriptional initiation site. Approximately 1 ag 
of a 450 bp Bglll-Hpal fragment isolated from 
phage x DNA was digested with Haelll. 200 ng of 35 
the resulting digestion products were mixed with 60 
pmoles each of phosphorylated synthetic 
oligonucleotides containing the model RBS. The 
ligated molecules were digested with Bglll and 
EcoRI and separated on a 5% polyacrylamide gel. 40 
The 270 bp ligation product was recovered from 
the gel, mixed with gel purified pRC2 vector that 
had been digested with Bglll and EcoRI, and in- 
cubated with T4 DNA ligase at 15°C for 15 hours. 
The ligation mixture was used to transform strain 45 
RR1(pRK248Clts). Transformants selected on 
ampicillin-containing medium were screened by re- 
striction analysis of the isolated plasmid DNA. The 
expected plasmid construction, pRC23, was con- 
firmed by further restriction enzyme digestions and 50 
by DNA sequence analysis across the EcoRI junc- 
tion (Fig. 7). 

For the construction of the pEV-vrf set of plas- 
mids (see Figure 8), plasmid pRC23 was digested 
with EcoRI and Hindlll and the pRC23/EcoRI-Hindlll 55 
vector isolated by preparative agarose gel elec- 



trophoresis. The mixture of synthetic 
oligonucleotides (32, 33, and 34 nucleotides) was 
combined with the mixture of the complementary 
sequences, heated to 58°C for 5 minutes in 150 
mM NaCI, and cooled slowly to allow annealing. 0.1 
pmoles of the synthetic duplexes were added to 
0.07 pmoles of the pRC23/EcoRI-Hindlll vector and 
incubated with T4 DNA ligase at 15° C for 15 hours. 
Strain RR1 (X cl857) was transformed with the 
ligation products. Six ampicillin resistant transfor- 
mants were selected for DNA sequence analysis. 
Of the six, two contained the expected sequence 
for pEV-vrfl, one for pEV-vrf2, and three for pEV- 
vrf3 (Fig. 3). 

For the expression of the AIDS env gene, one 
ug of a 2400 bp EcoRI -Hindlll DNA fragment, 
which was isolated from the cloned HTLV-III pro- 
virai genome by preparative agarose gel elec- 
trophoresis, was mixed with 0.1 ug of EcoRI - 
Hindlll digested vector DNA (pEV-vrfi, -2, or -3). 
After heating at 65 °C for 3 minutes, the mixtures 
were chilled on ice, and 20 ul ligation reactions 
were assembled, containing 50 mM Tris-HCI (pH 
7.4), 10 mM MgCI,. 10 mM DTT, 0.3 mM ATP, and 
200 units of T, DNA ligase. After incubation at 
15°C for 4 hours, the reactions were terminated by 
heating at 65 *C for 5 minutes. The ligation pro- 
ducts were used to transform E. coli strain MC1061 
containing plasmid pRK248clts. Transformants 
were selected on Luria broth agar containing 50 
ug/ml ampicillin at 30°C for 18 hours. Plasmid 
DNA was isolated from 1 ml of each culture and 
subjected to restriction analysis. All 12 isolates 
contained the expected plasmid construction. 
These intermediate constructions were then used 
to make pEV1, -2, and -3/env 44-640 by deleting 
the 600 bp between the EcoRI and Kpnl sites as 
described below. 

Approximately 0.5 ug of plasmid DNA was 
digested with Kpnl and EcoRI. The resulting termini 
were then treated with the Klenow fragment of DNA 
polymerase I in the presence of ail four deox- 
yribonucleotides (at 100 uM) at 37° C for 30 min- 
utes. This step results in the "filling-in" of the 5' 
overhang of the EcoRI terminus and the removal of 
the 3'overhang of the Kpnf terminus. Upon recir- 
culization of the linear plasmid and blunt-end liga- 
tion of these termini, an EcoRI site is regenerated. 
Transformants containing plasmids with the expect- 
ed deletion were identified by restriction analysis. 

A second set of deletion derivatives, desig- 
nated pEV/env 205-640 was constructed in a simi- 
lar fashion. A portion of the linear piasmid tnat had 
been digested with EcoRI and Kpnl and treated 
with Klenow, as described above, was funner rji- 



1 1 



17 



0 199 301 



18 



tion. No reaction was observed with sera from 
healthy individuals or from HTLV-I infected in- 
dividuals. The patient sera were derived from all 
parts of the United States including California and 
all AIDS patients' sera tested so far were found to 
be positive. The results suggest that these anti- 
bodies are mainly directed against the protein 
back-bone of the molecule. 

It appears, therefore, that the env gene pro- 
ducts constitute the best diagnostic reagents for 
the detection of AIDS associated antibodies. The 
env gene product of the instant invention encom- 
passes a large portion of the protein molecule and 
contains both the conserved and divergent portions 
of the molecule. In spite of the divergence ob- 
served between HTLVIll and ARV-2 sequences the 
recombinant env proteins of the instant invention 
synthesized by the bacteria react with AIDS patient 
sera derived from both geographical locations of 
the United States. One hundred percent (100%) of 
AIDS patient sera (50 individual samples, 25 de- 
rived from the East Coast of the United States and 
25 derived from California) tested showed high 
reactivity. This is strong evidence for the presence 
of conserved epitopes within the molecule against 
which the immune system could mount an anti- 
body reaction. The human immune system may 
thus be mounting an immune response against 
conserved epitopes of the envelope molecule, as 
suggested by the reactivity of the AIDS patient 
sera. The observed divergence between various 
isolates of HTLV-IM thus may not pose a problem 
for the use of recombinant protein as a vaccine. 
The 68Kd protein is ideally suited for such a pur- 
pose since it encompasses a large portion of the 
gene product and has the unique structural feature 
of containing both the extracellular hydrophilic re- 
gion and the membrane associated hydrophobic 
regions. This structural feature makes it welf suited 
for encapsulation into liposomes which have been 
used as vehicles for vaccination against other vital 
envelope proteins. 

Based on these discoveries it is proposed that 
in the practice of screening blood for AIDS only 
AIDS envelope protein or a variant of said protein 
be utilized. Utilizing the env AIDS protein of the 
instant invention, human blood can be screened for 
the presence of antibodies to the AIDS virus. This 
and other techniques are readily determined, once, 
as taught for the first time by the present invention, 
the envelope AIDS protein has been recognized to 
be the envelope protein of the etiologic agent of 
AIDS. The foregoing and other objects, features 
and advantages of the invention will be apparent 
from the Allowing examples of preferred embodi- 
ments c: "'e invention. 



Example 1 



Molecular cloning and nucleotide sequence anaiy- 
5 sis of the HTLV-lll proviraJ genome. 

The integrated proviral genome of HTLV-lll was 
recently cloned from the genomic DNA of H9 cells 
infected with HTLV-lll [Shaw, G.M. et ai., supra]. 

jo The proviraJ genome which was obtained by using 
Xbal digested H9/HTLV-IN DNA contained two in- 
ternal EcoRI sites within the viraJ genome and two 
additional sites in the cloning vector X J1. These 
sites were used for further subcloning of the three 

75 DNA fragments of 5.5Kb, 4.5Kb and 1.1Kb into 
PBR322 (ATCC No. 37017). Nucleotide sequence 
analysis of the proviraJ genome was determined by 
the chemical degradation method of Maxam, A.M. 
and Gilbert. W„ "Sequencing end-labelled DNA 

20 with base-specific chemical cleavages", Meth. En- 
zymol. 65, 499-560 (1980). For the sequence ana- 
lysis, DNA inserts from the three subclones were 
isolated by eiectroelution and further cleaved with 
appropriate restriction enzymes. The DNA frag- 

25 ments were labelled at their 5'ends with -r32P-ATP 
using polynucleotide kinase, or at their 3* ends with 
a-32P-NTP by filling in with DNA polymerase I - 
(Klenow fragment). The DNA fragments labelled at 
the two ends were cleaved with a second enzyme 

30 and the fragments labelled at a single end were 
purified on 5% acrylamide gels and used for se- 
quence analysis. For the sequence anaiysis of the 
env gene, a shotgun approacn was utilized where 
the 4.5 EcoRI fragment was cieaved with one of the 

35 following enzymes: Sglll. Hirdlli. Xhol. Avail. Hinfl 
and Sau3A and the restricticn fragments labelled 
and sequenced as described above. The 
nucleotide sequence of the envelope gene used m 
the present invention is shown in Figure 1. 

40 

Example 2 



Construction of pEV/env 44-640 

45 

pRC2 is a derivative of pBR322 containing a 
unique Bgl II site adjacent (on the amp R side) to 
the EcoRI site in the plasmid. This plasmid was 
constructed in the following manner. 20 ug of 

so pBR322 plasmid DNA were digested with EcoRI 
and then split into two reactions. In one, the pro- 
truding 5' single-stranded termini were removed 
with S1 nuclease; in the other reaction, the termini 
were filled-in by inorporating deoxynucleotides with 

55 the Klenow fragment of DNA polymerase i. Both 
reactions were terminated by phenol extraction fok 
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the desired protein by immunological or biological 
assays. Two methods are available to screen using 
immunological assay: screening of bacterial colo- 
nies for the presence of protein using antibody; 
and, preferably, the bacterial lysates are elec- 
trophoresed, blotted onto a nitrocellulose paper and 
then probed with the antibody. 

In a preferred embodiment of this invention, 
cultures of the E. coli strain MC 1061 transformed 
with pRK248clts and the pEV 1, 2, or 3/env 44-640 
(or pEV 1, 2 or 3/env 205-640) were grown in M9 
medium at 30 °C to mid-log phase and then in- 
duced by shifting to 42 °C for 2 hr. Samples of the 
bacterial cultures were then taken and subjected to 
SDS-polyacrylamide gel electrophoresis, followed 
by Western blot analysis to detect env proteins. 
The protein blots were treated with antisera to env 
AIDS proteins isolated either from immunized rab- 
bits or from AIDS patients previously shown to 
contain high titer antibodies to AIDS antigens. This 
was followed by incubation with ,25 l-labelled 
Staphylococcus aureus protein A, washing and 
autoradiography. Similar results were obtained with 
both sera except that the human serum was found 
to contain much higher titers of anti-HTLV-l!l anti- 
bodies and was devoid of all background reactivity 
with the E. coli proteins. For this reason human 
antibodies were used in all subsequent character- 
ization. 

Figure 4 shows the pattern of reactivity of the 
env AIDS proteins synthesized in bacteria 
(recombinant proteins) with anti-HTLV-lll anti- 
bodies. The open reading frame in pEV3/env 44- 
640 encodes a protein that should migrate as a 68 
Kd band on the gel. In fact, a 68 Kd band is 
observed in the lane corresponding to the induced 
cells containing pEV3/env 44-640 (lane C). How- 
ever, in addition to the 68 Kd band, these cells 
synthesized proteins of 35 Kd, 25 Kd and 17 Kd 
which specifically cross-reacted with anti-HTLV-lll 
antibodies. No HTLV-III cross-reacting bands are 
evident in the uninduced control (Lane b) or in a 
second negative control sample (Lane a) of in- 
duced cells containing a plasmid that directs the 
synthesis of v-bas p2l oncogene product (Lacal, 
J.C. et al., supra). The appearance of multiple 
bands synthesized from the env gene sequences 
was an unexpected result. Another unexpected re- 
sult was the synthesis of env gene products from 
the plasmid (pEVI/env 44-640) where the insert 
was placed in the wrong reading frame with re- 
spect to the initiator codon immediately down 
stream of the P L promoter (Lane d). In this case, E. 
coii cells containing plasmid pEVl/env. 44-640 syn- 
thesized a 63 Kd protein in addition to the 35 Kd, 
25 Kd and 17 Kd proteins. These results could be 



readily explained when the nucleotide sequence of 
the envelope gene (Fig. 1) was examined. About 
155 bases downstream to the Kpnl site is an ATG 
codon which appeared to be utilized for the synthe- 

5 sis of the env gene product by the expression 
plasmid pEV1/env 44-640. Internal translation initi- 
ation is also the likely explanation for the appear- 
ance of the 35Kd, 25Kd and 17Kd proteins. Initi- 
ation codons which are preceded by so-called 

io Shine-Dalgarno sequences (AGGA) are found with- 
in the env coding region at locations that are con- 
sistent with the sites of the observed protein pro- 
ducts. 

To confirm the above interpretation and to rule 

75 out the possibility that the smaller proteins are not 
formed as a result of premature termination or from 
proteolytic cleavage of the larger product, another 
deletion mutant in which sequences between the 
Kpnl and Stul sites were deleted were constructed. 

20 This expression plasmid contains the coding se- 
quences from amino acid positions 205-640 which 
could code for a protein of 49 Kd. Analysis of the 
proteins induced from E. coli harboring this plas- 
mid verified that, in fact, these cells synthesize a 

25 49 Kd protein in addition to the 35 Kd, 25 Kd and 
17 Kd proteins (lane e, Fig. 4). From these results, 
it was concluded that pEV3/env 44-640 expression 
piasmid directs the synthesis of a 68 Kd protein in 
addition to several additional smaller polypeptides - 

30 (i.e., 35Kd, 25Kd and l7Kd) produced from all of 
the env expression plasmids resulting from internal 
translation initiation within the env gene. 

Screening of AIDS SERA 

35 

Because anti-HTLV-lll antibodies are found in 
more than 90% of the AIDS patients, it was of 
interest to see if the bacteriaily synthesized env 
gene products could be used as diagnostic tools 
40 for the detection of these antibodies. For this analy- 
sis, total cell protein from an induced bacterial 
culture was fractionated by SDS-PAGE and trans- 
ferred to a nitrocellulose filter by Western blotting 
technique. Strips of the filter containing transferred 
45 proteins were reacted with 1000-fold diluted human 
sera, and the antigen-antibody complexes formed 
were detected by incubation of the strips with 1 25- 
Habelled Staphylococus aureus protein A followed 
by autoradiography. Prominent bands correspond- 
so ing to reaction of the antibody to the 68 Kd, 35 Kd, 
25 Kd and 17 Kd proteins were consistently ob- 
served when the serum used was from savaris 
with AIDS syndrome. The results of sucn asu/s 
with different human sera are presented -n ^gur« 
55 5. The negative controls used were norm* **^r**n 
sera and serum from a patient with HTi»-< ■-♦<*;. 
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tion of an appropriate host is also controlled by a 
number of factors recognized by the' art. These 
include, for example, compatibility with the chosen 
vector, toxicity of proteins encoded by the hybrid 
plasmid, ease of recovery of the desired protein, s 
expression characteristics, biosafety and costs. A 
balance of these factors must be struck with the 
understanding that not all hosts may be equaJly 
effective for expression of a particular recombinant 
DNA molecule. 10 

A preferred embodiment of the instant inven- 
tion ts to express segments of the AIDS env protein 
in E. coli by inserting restriction fragments isolated 
from the cloned proviral genome into the versatile 
pEV-vrf (variable reading frame) expression plas- 75 
mids (for detaifs of construction see Example 2). 
Triese versatile pEV-vrf plasmids are derivatives of 
pBR322 which contain the phage lambda P L pro- 
moter, a synthetically-derived ribosome-binding 
site, and convenient cloning sites (EcoR!, BamHI, 20 
Clal and Hindlll) just downstream to the initiation 
codon (Figure 8). A set of three plasmids was 
constructed to accomodate all three translation^ 
reading frames. The P L promotor is regulated by a 
temperature-sensitive cl repressor encoded on the 25 
compatible plasmid pRK248clts [ATCC 33766; Ber- 
nard, H.U. and Helinski, D.R., "The use of the \ 
phage promotor P L to promote gene expression in 
hybrid plasmid cloning vehicles", Meth. Enzymol. 
68, 482-492 (1979)]. These expression plasmids 30 
have been used to produce substantial amounts of 
several heterologous proteins in E. coii including v- 
bas p21 [Lacal, J.C. et al., "Expression of Normal 
and Transforming H-ras genes in E. coli and pu- 
rification of their encoded p2l proteins", PNAS 81. 35 
5305-5309 (1984)] and -murine interieukin-1 
[Lomedico, P.T. et al., "Cloning and Expression of 
Murine lnterleukin-1 cDNA in E. coli". Nature 312, 
453-462 (1984)]. 

In the present synthesis the preferred initial 40 
cloning vehicle is the bacterial plasmid pBR322 - 
(ATCC 37017) and the preferred initial restriction 
endonuclease sites therein are the EcorR! and Hin- 
dlll sites {Figure 3). Insertion of proviral DNA con- 
tained within the genome of H9 cells into these 45 
sites provides a large number of bacterial clones 
each of which contains one of the proviral DNA 
genes or fragments thereof present in the genome 
of H9 cells. Only a very few of these clones will 
contain the gene for env AIDS or fragments there- so 
of. 



The preferred host for initial cloning and ex- 
pression of the env AIDS gene in accordance with 
this invention is E. coli MC 1061 [Casadaban, M.J. 
and Cohen, S.M., "Analysis of Gene Control Sig- 
nals by DNA Fusion and Cloning in E. coli", J. Mol. 
Biol., 135, 179-207 (1980)]. 

The coding sequences for amino acid residues 
#44 to 640 of the env protein are located down- 
stream of the P L promoter between the Kpnl and 
Hindlll sites on the restriction map as shown in 
Figure 3. Aside from the location of these conve- 
nient restriction sites, these sequences were cho- 
sen for bacterial expression experiments because 
they did not include the amino-terminal signal pep- 
tide as well as the hydrophobic transmembrane 
segment at the car boxy I end. These sequences 
were excluded to avoid possible toxicity problems 
which can occur when hydrophobic proteins are 
over-produced in bacterial cells. In a preferred em- 
bodiment of this invention an expression plasmid 
was constructed that would direct the synthesis of 
this segment of the env gene product (designated 
pEV/env 44-640), an intermediate construction was 
first made by inserting a 2400 bp EcoRI-Hindlll 
fragment between the EcoRI and Hindlll sites in the 
pEV-vrf plasmids. The HTLV-III sequences (600 
bp) between the EcoRI and the Kpnl site were then 
removed from the intermediate construction as 
shown in Figure 3. These plasmid constructions 
were carried out with ail three pEV-vrf plasmids so. 
that subsequent deletions could be made and the 
correct reading frame maintained. In addition, the 
constructs ns made in the incorrect reading frames 
servec as :rnportant controls in the expression ex- 
periment described below. 

in aether embodiment of this invention, a sec- 
ond se-. of expression plasmids were constructed in 
a similar fashion by deleting sequences between 
EcoRI and Stul sites which occur 483 bp down- 
stream of the env gene. Again these deletions - 
(designated pEV/env 205-640) were made in all 
three reading frames. The translation termination 
codon used in all of the env expression plasmids is 
presumably an in-frame TAA located 23 bp down- 
stream of the Hindlll site in the plasmid. Thus, 8 
amino add residues at the carboxyl terminus are 
encoded by pBR322 sequences contained within 
the pEV-vrf expression plasmids. 

Expression of ENV AIDS 

There are several approach** to *cr©en for 
bactenal clones containing env AiCS :0^4A. These 
include, for example, RNA selection ^yondi ration, 
differential hybridization, hybndiza&oo a syn- 
thetic probe and screening for cion#* -**t produce 
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divergence was observed between the HTLV-III - 
(HXB-3) and LAV sequence. Among the HTLV se- 
quences, the divergence was about 1.6%. How- 
ever, approximately 17% divergence was observed 
between HTLV-flf and ARV-2 and this was more 
pronounced in the extracellular region of the en- 
velope gene product {Figure 2). This high rate of 
divergence could be due to the geographical loca- 
tion from where the two isolates were derived or 
the time of isolation of these variants. ARV-2 was 
isolated from the west coast of the United States 
more recently. The HTLV-lfl isolates for which the 
nucleotide sequences have been determined were 
all obtained from the east coast of the United 
States a year earlier. LAV was obtained from a 
French patient who appears to have acquired the 
virus in New York about the same period. The 
observed differences in the sequence probably re- 
flect divergent evolution of strains separated in 
time or geography or both. Within the env region, 
the highest level of divergence is in the extracel- 
lular portion of the protein. 

Expression Vector 

A wide variety of host/cloning vehicle combina- 
tions may be employed in cloning the double- 
stranded DNA. For example, useful cloning ve- 
hicles may consist of segments of chromosomal, 
nonchromosomal and synthetic DNA sequences, 
such as various known bacterial plasmids. e.g. 
plasmids from E. coli such as pBR322. phage 
DNA, and vectors derived from combinations of 
plasmids and phage DNAs such as plasmids which 
have been modified to employ phage DNA or other 
expression control sequences or yeast plasmids. 
Useful hosts may include microorganisms, mam- 
malian cells, plant cells and the like. Among them 
microorganisms and mammalian cells are prefer- 
ably employed. As preferable microorganisms, 
there may be mentioned yeast and bacteria such 
as Escherichia coli, Bacillus subtilis. Bacillus 
stearothermophilus and Actinomyces. The above- 
mentioned vectors and hosts may also be em- 
ployed for the production of a protein from a gene 
obtained biologically as in the instant invention. Of 
course, not all host/vector combinations may be 
equally efficient. The particular selection of 
host/cloning vehicle combination may be made by 
those of skill in the art after due consideration of 
the principles set forth without departing from the 
scope of this invention. 

Furthermore, within each specific cloning ve- 
hicle, various sites may be selected for insertion of 
the double-stranded DNA. These sites are usually 
designated by the restriction endonuclease which 



cuts them. For example, in pBR322 the EcoRI site 
ts located just outside the gene coding for ampicil- 
lin resistance. Various sites have been employed 
by others in their recombinant synthetic schemes. 

5 Several sites are well recognized by those of skill 
in the art. It is, of course, to be understood that a 
cloning vehicle useful in this invention need not 
have a restriction endonuclease site for insertion of 
the chosen DNA fragment. Instead, the vehicle 

70 could be joined to the fragment by alternative 
means. 

The vector or cloning vehicle and in particular 
the site chosen therein for attachment of a selected 
DNA fragment to form a recombinant DNA mol- 
15 ecule is determined by a variety of factors, e.g., 
number of sites susceptible to a particular restric- 
tion enzyme, size of the protein to be expressed, 
susceptibility of the desired protein to proteolytic 
degradation by host cell enzymes, contamination of 

20 the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 
characteristics, such as the location of start and 
stop codons relative to the vector sequences, and 
other factors recognized by those of skill in the art. 

25 The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these 
factors, not air selections being equally effective for 
a given case. 

There are several known methods of inserting 

30 DNA sequences into cloning vehicles to form re- 
combinant DNA molecules which are equally useful 
in this invention. These include, for example, direct 
ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by liga- 

35 tion, or extension of the DNA strand with DNA 
polymerase and an appropriate single stranded 
template followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences of the DNA fragment insert- 

40 ed at the selected site of the cloning vehicle may 
include nucleotides which are not part of the actual 
structural gene for the desired potypeptide/protein 
or may include only a fragment of the complete 
structural gene for the desired protein. It is only 

45 required that whatever DNA sequence is inserted, a 
transformed host will produce a protein/peptide 
having an immunological activity to the AIDS env 
protein or that the DNA sequence itself is of use as 
a hybridization probe to select clones which con- 
so tain DNA sequences useful in the production of 
polypeptides/proteins having an immunological ac- 
tivity to the AIDS env protein. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as 

55 to permit that host to express the protein or portion 
thereof for which the hybrid DNA codes. The selec- 
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M - Methionine; N = Asparagine; P = Proline; 
Q = Glutamine; R = Arginine; S = Serine; 

T = Threonine; V = Valine; W = Tryptophan; 
Y = Tyrosine; Z = Glutamine or Glutamic Acid. 

In accordance with the present invention, the 
search for the envelope protein of the etiologic 
agent for acquired immune deficiency syndrome 
(AIDS) has led to the isolation and sequencing of 
the proviral gene of the AIDS virus. It has now 
been discovered, for what is believed to be the first 
time that the postulated etioiogic agents of AIDS, 
lymphadenopathy-associated virus (LAV). AIDS-as- 
sociated retrovirus (ARV) and human T-cell 
leukemia/Iymphoma/lymphotropic virus (HTLV III) 
are in fact variants of the same virus. For purposes 
of this invention, in the specification and claims the 
virus causing AIDS will be referred to herein as 
AIDS virus. AIDS virus will be understood to in- 
clude the variants which have been postulated as 
the causative agents of AiDS, namely LAV, ARV 
and HTLV III. The enveiope protein of the AIDS 
virus (env AIDS) is a 97,200 dalton protein with 32 
potential N-glycosylation sites. Nucleotide se- 
quence analysis of the AIDS envelope gene of the 
putative etiologic agents of AIDS demonstrates that 
all the viruses are variants of the same virus. That 
means that there is approximately 1 to 20% diver- 
gence or variation from the sequence of the en- 
velope gene of HTLV III and the sequences of the 
envelope genes of the other viruses LAV and ARV- 
2. The amino acid sequence of the env AIDS is set 
forth in Figure 6(a). The amino acid distribution is 
set forth in Figure 6(b). 

The nucleotide sequence of the enveiope gene 
is shown in Figure 1. The proviral DNA sequence, 
using methods known to one of ordinary skill in the 
art such as the chemical degradation method of 
Maxam and Gilbert of the M13 sequencing system 
of Messing which is a modification of the dideoxy 
nucleotide chain termination method of Sanger, 
was analyzed to determine the location of the re- 
gion coding for the enveiope protein. The location 
of an open reading frame, i.e. a long stretch of 
triplet codons not interrupted by a translational stop 
codon, for the envelope gene was determined. The 
open reading frame coding for the env gene is 863 
amino acids and contained an ATG codon at the 
eighth position from the 5' end of the reading 
frame. The ATG codon is known to be a universal 
translation-initiation codon. 

The integrated proviral genome of HTLV-III was 
cloned from the genomic DNA of H9 cells infected 
with HTLV-III [Shaw, G.M. et a!.. "Molecular char- 
acterization of Human T-cell leukemia 
(lymphotropic) virus type III in the acquired im- 
mune deficiency syndrome", Science 226, 1165- 



1171 (1984)]. Since the HTLV-III provrrus was 
found to lack XbaJ restriction sites, a genomic 
Jibrary was constructed by using Xbal digested 
H9/HTLV-III DNA. There are several methods avail- 
5 able to one of ordinary skill in the art for screening 
the bacterial ciones containing the AIDS env pro- 
tein cDNA. These include, for example. RNA selec- 
tion hybridization, differentiaJ hybridization with a 
synthetic probe or screening for clones that pro- 

10 duce the desired protein by immunological or bio- 
logical assays. From the genomic library, colonies 
of cells transformed with DNA that contajns the 
HTLV III sequences were selected by hybridization 
screening of the library with HTLV III cDNA The 

rs DNA insert of the hybndization-positive clone, 
HXB-3, excised from the plasmid DNA and 
sequence: 

The .:: dieted product of the env gene shares 
many fea _res in common with the envelope gene 

20 products ct other retroviruses. Thus, a hydrophobic 
region is seen in the middle of the protein (amino 
acids 519-534) which includes a processing site for 
the cleavage of the precursor protein into exterior 
and transmembrane proteins. Similarly, the amino 

25 terminaJ end contains a short stretch of hydropho- 
bic amino acids (amino adds 17-37) which con- 
stitutes a potential signal sequence. The HTLV-III 
envelope precursor differs from the other retroviral 
envelope protein precursors in that it contains an 

30 additionaJ stretch of 180 amino acids' at the car- 
boxy terminus. 

Polymorphism within the Envelope Region of AIDS 
Virus 

35 

The recent publication of the nucleotide se- 
quences of LAV, ARV-2 and HTLV-III [Ratner, L, et 
a!., supra; Sanchez-Pescadon, R., et al., supra; 
Wain-Hobson, S., et al.. supra] aJlows a detailed 

40 comparison of these various isolates obtained from 
AIDS patients from different parts of the world. 
HTLV-III clones were isolated from AIDS 'patient 
lymphocytes obtained from the east coast of the 
United States, while LAV was isolated from a 

45 French man and ARV was isolated from a patient in 
California. A comparison of the sequence data con- 
firms the earlier observations made using restne- 
tion enzyme site analysis which showed approxi- 
mately 10% variation. The present anaJysis shows 

so that the various isolates show the greatest amount 
of conservation in the gag and pol regions while 
the most divergence occurs in the anv region. A 
comparison of the five env sequences is presented 
in Figure 2. With respect to the envelope gene, 

55 HTLV-III and LAV are more closely related to each 
other than the ARV clone. Approximately 1.6% 
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Detailed Description of the Invention 

In the description the following terms are em- 
ployed: 

Nucleotide : A monomeric unit of DNA consist- 
ing of a sugar moiety (pentose), a phosphate, and 
either a purine or pyrimidine base (nitrogenous 
heterocyclic). The base is linked to the sugar moi- 
ety via the glycosidic carbon (V carbon of the 
pentose). That combination of a base and a sugar 
is called a nucleoside. Each nucleotide is char- 
acterized by its bctee. The four DNA bases are 
adenine ("A"), guanine ("G"), cytosine ("C") and 
thymine ("T"). 

DNA Sequence : A linear array of nucleotides 
connected one to the other by phosphodiester 
bonds between the 3' and 5' carbons of adjacent 
pentoses. 

Codon : A DNA- sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino 
acid, a translation start signal or a translation ter- 
mination signal. For example, the nucleotide triplets 
TTA, TTG, CTT, CTC. CTA and CTG encode for 
the amino acid leucine ("Leu"). TAG, TAA and 
TGA are translation stop signals and ATG is a 
translation start signal. 

Reading Frame : The grouping of codons during 
translation of mRNA into amino acid sequences. 
During translation the proper reading frame must 
be maintained. For example, the sequence 
GCTGGTTGTAAG may be translated in three read- 
ing frames or phases, each of which affords a 
different amino acid sequence: 

GCT GGT TGT AAG = Ala-Gly-Cys-Lys 

G CTG GTT GTA AG ~ Leu-Val-Val 

TGG TTG TAA G = Trp-Leu-(STOP) 
Polypeptide: A linear array of amino acids con- 
nected one to the other by peptide bonds between 
the ar-amino and carboxy groups of adjacent amino 
acids. 

Genome : The entire DNA of a cell or a virus. It 
includes inter alia the structural genes coding for 
the polypeptides of the substance, as well as oper- 
ator, promoter and ribosome binding and inter- 
action sequences, including sequences such as the 
Shine-Dalgarno sequences. 

Structural Gene : A DNA sequence which en- 
codes through its template or messenger RNA 
("mRNA") a sequence of amino acids characteris- 
tic of a specific polypeptide. 

Transcription : The process of producing mRNA 
from a structural gene. 



Translation : The process of producing a poly- 
peptide from mRNA. 

Expression : The process undergone by a struc- 
tural gene to produce a polypeptide. It is a com- 

5 btnation of transcription and translation. 

Plasmid : A circular double-stranded DNA mol- 
ecule that is not a part of the main chromosome of 
an organism containing genes that convey resis- 
tance to specific antibiotics. When the plasmid is 

w placed within a unicellular organism, the character- 
istics of that organism may be changed or trans- 
formed as a result of the DNA of the plasmid. For 
example, a plasmid carrying the gene for 
tetracycline resistance (Tet R ) transforms a cell pre- 

75 viously sensitive to tetracycline into one which is 
resistant to it. A cell transformed by a plasmid is 
called a "transformant". 

Cloning Vehicle : A plasmid. phage DNA or 
other DNA sequences which are able to replicate in 

20 a host cell, which are characterized by one or a 
small number of endonuclease recognition sites at 
which such DNA sequences may be cut in a deter- 
minable fashion without attendant loss of an essen- 
tial biological function of the DNA, e.g., replication, 

25 production of coat proteins or loss of promoter or 
binding sites, and which contain a marker suitable 
for use in the identification of transformed cells, 
e.g., tetracycline resistance or ampicillin resistance. 
A cloning vehicle is often called a vector. 

30 Cloning : The process of obtaining a population 

of organisms or DNA sequences derived from one 
such organism or sequence by asexual reproduc- 
tion. 

Recombinant DNA Molecule or Hybrid DNA : A 

35 molecule consisting of segments of DNA from dif- 
ferent genomes which have been joined end-to-end 
outside of living cells and have the capacity to 
infect some host cell and be maintained therein. 
The nomenclature used to define the peptides 

40 or proteins is that used in accordance with conven- 
tional representation such that the ammo group at 
the N-terminus appears to the left and the carboxy! 
group at the C-terminus to the right, By natural 
amino acid is meant one of the amino acids com- 

45 monly occurring in natural proteins comprising Gly, 
Ala, Vat, Leu, lie, Ser, Thr, Lys, Arg, Asp. Asn, Glu, 
Gin. Cys, Met, Phe, Tyr, Pro, Trp and His. By Nle 
is meant norleucine, and by Nva is meant nor- 
valine. Where L and D forms are possible, it is the 

so L-form of the amino acid that is represented unless 
otherwise expressly indicated. In addition, amino 
acids have been designated by specific letters of 
the alphabet such that: A = Alanine; B = Aspartic 
Acid or Asparagine; C = Cysteine; D = Aspartic Acid; 

55 E = Glutamic Acid; F = Phenylalanine; G = Glycine; 
H = Histidine; I = Isoleucine; K = Lysine; L= Leucine; 
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lymphadenopathy-associated virus (LAV), AIDS-as- 
sociated retrovirus (ARV) and the human T-cell 
leukemia^ymphoma/lymphotropic virus type III - 
(HTLV III). 

A further aspect of this invention reiates to a 5 
diagnostic method for testing human blood for the 
presence of antibodies to the env AIDS protein. 
This aspect of the invention overcomes the prob- 
lems of all previously used blood tests for AIDS. 
One of the problems is the use of compositions to to 
bind AIDS antibody which contain proteins or pep- 
tides which were not derived solely from the AIDS 
etiologic agent. A composition using homogeneous 
envelope AIDS protein of this invention overcomes 
the nonspecificity of the prior tests or assays. Yet is 
another aspect of this invention is a diagnostic 
method for detecting and/or determining the pres- 
ence of the antigen in human blood. 

Another aspect of this invention is to use the 
env AIDS proteins of the instant invention as anti- 20 
gens suitable for providing protective immunity 
against AIDS when incorporated into a vaccine. 

Brief Description of the Drawings 

25 

Fig. 1. The nucleotide sequence of the en- 
velope gene of the HTLV-III proviral genome 
(HXB-3). 

30 

Fig. 2. Comparison of the amino acid se- 
quence of the env protein of the five pur- 
ported etiologic agents of AIDS. Amino acid 
sequences are aJigned to give maximum ho- 
mology. ?5 

Fig. 3. Construction of the pEV/env44-640 
expression plasmids. The upper left panel 
shows a simplified restriction site map of the 
3.15 Kb EcoRI-Xhol segment of the HTLV-III 40 
genome which contains the env coding re- 
gion (cross-hatched arrow). The right panel 
shows the structure and pertinent sequences 
of the pEV-vrf plasmids. The solid black 
region .epresents the synthetic ribosome 45 
binding site sequences upstream of the ATG 
initiation codon (overlined). See Example 2 
for a detailed description of the env expres- 
sion plasmid constructions. 

50 

Fig. 4. Western blot analysis of env coded 
antigens produced in E. coli. TotaJ bactenal 
proteins were resolved by SDS-PAGE. 
eiectro-b lotted onto a nitrocellulose filter, and 
env encoded proteins were detected by 55 
reacting with human sera as described m 



Example 5: a) negative control, cells contain- 
ing DJCL-E30 (p2lT) induced at 42° C for 2 
hours; b) unmduced control, cells containing 
pEV3/env44-640 maintained at 30 *C; c) 
pEV3/env44-640; d) pEVl'env44-640; and e) 
pEV3/env205-640 induced at 42°C for 2 
hours. 

Fig. 5. Recognition of bacterially synthesized 
HTLV-III env gene products by antibodies in 
AIDS patient sera. Bacteria! lysates contain- 
ing recombinant env proteins were subjected 
to Western blot analysis as described in 
Example 5. Individual strips were then in- 
cubated with a 1000-fold dilution of individual 
sera followed by treatment with ,2 *l-tebeied 
protein A, (upper part) Serum samples were 
from the following donors: (lane 1) normaJ 
healthy donor; (lanes 2-18) AIDS patient sera 
collected from the West Coast of the USA. - 
(Lower part) Serum samples were taken from 
the following donors: {lane 1 ) donor found to 
be HTLV-1{ + } by Elisa using disrupted 
virus; (lanes 4, 5, 11 and 15) healthy, normal 
donors; (lanes 2, 3. 6, 8, 10, 12, 13, 14, 16, 
17 and 18) AIDS patient sera from the East 
Coast of the USA. 

Fig. 6A. The amino acid sequence of the 
AIDS envelope protein. 

Fig. 6B. The amino acid distribution of the 
AJDS envelope protein. 

Fig. 7. Construction of the expression vector 
pRC23. The Shine-DaJgarno sequence (SD) 
is overlined and the location of the synthetic 
ribosome binding site sequence in the plas- 
mid is represented by the solid black seg- 
ment. The plasmid contains the entire se- 
quence of pBR322 and thus confers resis- 
tance to both ampicillin (amp R ) and 
tetracycline (tet n ). 

Fig. 8. Construction of the pEV-vrf vectors. 
The synthetic oligonucleotides for each plas- 
mid which were placed downstream of the 
SD sequence in pRC23 are shown with the 
locations of the restriction enzyme cleavage 
sites. The ATG initiation codon is overlined, 
and the placement of the additional A-T base 
pairs is designated by the rectangle. The 
plasmids confer resistance to ampiciKin only. 
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cytopathic characteristic of HTLV ill was critical to 
determining ultimately the specific retroviral origin 
of the disease. Thus the etiologic agent of AIDS 
was isolated by use of immortalized human neo- 
plastic T cell lines (HT) infected with the cytopathic 
retrovirus characteristic of AIDS, isolated from AIDS 
afflicted patients. Seroepidemiological assays using 
this virus showed a complete correlation between 
AIDS and the presence of antibodies to HTLV III 
antigens [Sarngadharan, M.G. et al., supra; 
Schupbach, J. et al., supra]. In addition, nearly 
85% of Datients with lymphadenopathy syndrome 
and a significant proportion of asymptomatic ho- 
mosexual men in AIDS endemic areas were also 
found to carry circulating antibodies to HTLV III. 
Taken together, ail these data indicate HTLV Ml to 
be the etiologic agent for AIDS. 

Until the successful culturing of AIDS virus 
using H-9 cell line [PCT application, publication no. 
WO 85/048971 the env AIDS protein of the AIDS 
virus had not been isolated, characterized or syn- 
thesized. This in major part is due to the fact that 
the virus is cytopathic and thus isolation of the 
virus was not possible [Popovic, M. et al., supra]. 
Once the human T-cell line resistant to the 
cytopathic effects of the virus was discovered, a 
molecular clone of proviral DNA could be achieved. 

The need for a sensitive and rapid method for 
the diagnosis of AIDS in human blood and its 
prevention by vaccination is very great. Virtually ail 
the assays/tests presently available are fraught with 
errors. In fact the Center for Disease Control - 
(CDC) has indicated that presently available tests 
be used solely for screening units of blood for 
antibody to HTLV III. The CDC went further by 
stating that the presently available ELISA tests can 
not be used for general screening of high risk 
pupuiations or as a diagnostic test for AIDS - 
[Federal Register 50(48), 9909, March 12, 1985]. 
The errors have been traced to the failure to use a 
specific antigenic protein of the etiologic agent for 
AIDS. The previously used proteins were derived 
from a viral lysate. Since the lysate is made from 
human cells infected with the virus, i.e. the cells 
used to grow the virus, the lysate will contain 
human proteins as well as viral proteins. Thus 
preparation of a pure antigen of viral protein is very 
difficult. The antigen used produced both false 
positive and false negative results [Budiansky, S., 
"AIDS Screening, False Test Results Raise 
Doubts", Nature 312. 583(1984)]. The errors caus- 
ed by the use of such lysate proteins/peptides can 
be avoided by using a composi tion for binding 



AIDS antibodies which is substantially free of the 
non-AIDS specific proteins. Compositions that are 
substantially pure AIDS envelope protein can be 
used as antigens. 
5 The AIDS envelope protein of the instant inven- 

tion has been established to have conserved epi- 
topes which permit its use to screen for, diagnose 
and/or prevent by vaccination the infection by AIDS 
virus. The instant invention demonstrates that the 

w envelope protein with its conserved epitopes in- 
cludes all the variants which have been claimed as 
the sole etiologic agent. 

The envelope AIDS protein of the present in- 
vention may be produced by conventionally known 

J5 methods. The processes by which the novel pro- 
tein may be produced can be divided into three 
groups: (1) chemical synthesis; (2) preparation of a 
gene prepared by chemical synthesis which is in- 
serted into a host and'a protein is produced by the 

20 host; and (3) a corresponding gene obtained 
biotechnically is inserted into a host and a protein 
is produced by the host. 

In one embodiment of this invention, recom- 
binant DNA techniques are utilized by which env 

25 AIDS DNA from a natural source is introduced into 
a cell to produce the env AIDS protein. One meth- 
od of obtaining DNA which encodes env AIDS is to 
read the genetic, code in reverse and synthesize an 
oligodideoxynucleotide which should encode the 

30 env AIDS amino acid sequence. As the env protein 
has not been isolated or characterized this ap- 
proach cannot be pursued. 

Alternatively gene expression can be obtained 
using recombinant DNA technology if DNA isolated 

35 from natural sources is used instead of synthetic 
DNA. 

Summary of the Invention 

40 This invention is directed to the engineering of 

HTLV III env gene into suitable expression vectors; 
transformation of host organisms with such expres- 
sion vectors; and production of envelope AIDS pro- 
tein (env AIDS) by culture of such transformed 

45 cells. Another aspect of the present invention re- 
lates to the isolation and use of the resulting re- 
combinant env AIDS protein. 

Another aspect of the present invention is the 
identification and determination of the proviral DNA 

so sequence. More specifically, this aspect of the in- 
vention relates to determination and comparison of 
the proviral nucleotide sequence of the envelope 
genes of the purported etiologic agent of AIDS i e. 

55 
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Recombinant acquired immune deficiency syndrome (AIDS) viral envelope protein and method of testing for AJDS 



This invention relates to a protein, designated 
env AIDS, which is the viral envelope protein of the 
etiologic agent for acquired immune deficiency 
syndrome, tne expression vector which encodes 
env AIDS, the production of env AIDS using recom- 
binant DNA technology and a method for detecting 
the presence of AIDS antibodies in human blood. 

Background of the Invention 

From 1981 to date, there have been more than 
eight thousand (8,000) people diagnosed as having 
acquired immune deficiency syndrome (AIDS) - 
[N.Y. Times. A-1 1 January 11. 1985]. AIDS has 
been characterized by the onset of severe op- 
portunistic infections secondary to an effect on the 
body's immune system [Gottlieb, M.S. et al., 
"Pneumocystis Carinii Pneumonia and MucosaJ 
Candidiasis in previously healthy homosexual men: 
evidence of a new acquired cellular im- 
munodeficiency", N. Eng. J. Med. 305, 1426-1431 - 
(1981)]. The disease has been found in male 
homosexuals, patients receiving blood products, in- 
travenous drug addicts, and individuals originating 
from Haiti and Central Africa [Piot, P. et al., 
"Acquired immunodeficiency syndrome in a het- 
erosexual population in Zaire", Lancet 11, 65-69 - 
(1984)]. The causative agent was suspected to be 
of viral origin as the epidemiological pattern of 
AIDS was consistent with a transmissable disease. 
At least three (3) retroviruses have been isolated 
from cultured T-cel!s of several patients with AIDS, 
or from white blood cells of persons at risk for the 
disease. A novel human retrovirus called 
lymphadenopathy-associated virus (LAV) was dis- 
covered and its properties were consistent with its 
etiological role in AIDS. That virus was isolated 
from a patient with lymph adenopathy and hence 
Lhe name [Montagnier. L et ai.. "A New Human T- 
lymphotropic retrovirus: characterization and possi- 
ble role in lymphadenopathy and acquired immune 
deficiency syndromes. In Human T-Cell 
Leukemia/Lymphoma Virus, R.C. GaJlo, M. Essex 
and L Gross, eds. (Cold Spring Harbor, N.Y : Cold 
Spring Harbor Laboratory) pp. 363-370]. Other hu- 
man retroviruses, specifically two subgroups of the 
human T-cell leukemia/lymphoma/lymphotropic 
virus, types I and HI have been isolated [HTLV I: 
Poiesz. B.J. et al., "Detection and isolation of type 
C retrovirus particles from fresh and cultured lym- 
phocytes of a patient with cutaneous T-cell lym- 
phoma", PNAS (USA) 77, 7415-7419 (1980); HTLV- 
III: Popovic, M. et al.. "Detection, isolation and 
continuous production of cytopathic retroviruses - 



(HTLV-III) from patients with AIDS and pre-AlDS", 
Science 224, 497-500 (1984)]. Still another virus,' 
the AJDS-associated retrovirus (ARV), was pro- 
posed as the causative agent [Levy, JA_ et aJ., 
5 "Isolation of lymphocytopathic retroviruses from 
San Francisco patients with AIDS". Science 225, 
840-842 (1984)]. Both the HTLV-III and ARV retro 
viruses display biologicaJ and sero-epidemrologicaJ 
properties similar to LAV [Levy JA et al., supra, 
iQ Popovic, M. et al., supra]. As seen from the above, 
at least three (3) retroviruses have been postulated 
as the etiologic agent of AIDS: LAV; ARV; and, 
HTLV subtypes I and III. 

LAV, HTLV Ell and ARV-II genomes have been 
is molecularly cloned [Schupbach, J. et al., 
"Serological analysis of a subgroup of human T- 
lymphotropic retroviruses (HTLV-III) associated with 
AIDS", Science 224, 503-505 (1984); Alizon, M. et 
al., "Molecular Cloning of lymphadenopathy -asso- 
20 dated virus". Nature 312, 757-760 (1984)]. The 
complete nucleotide sequence of the proviral 
genome of LAV, ARV and HTLV III has been deter- 
mined [Ratner, L et al., "Complete nucleotide se- 
quence of the AIDS virus. HTLV III", Naure 313, 
25 277-284 (1985); Sanchez- Pescador. R. et al-, 
"Nucleotide sequence and expression of an AJDS- 
associated retrovirus (ARV-2)", Science 227, 484- 
492 (1985); Wain-Hobson, S. et al., "Nucleotide 
sequence of the AJDS virus, LAV", Cell 40. 9-17 - 
30 (1985)]. 

One reason for the difficulty in determining the 
etiologic agent of AIDS was due to the reactivity of 
various retroviral antigens with serum samples from 
AIDS patients. For example, serum samples from 
35 AIDS patents have been shown to react with anti- 
gens of HTLV I and HTLV ill [HTLV-I: Essex, M. et 
al.. "Antibodies to Cell Membrane Antigens Asso- 
ciated with Human T-Cell Leukemia Virus in Pa- 
tients with AIDS". Science 220, 859-862 (1983); 
40 HTLV-III:" Sarngadharan. M.G. et al., "Antibodies 
Reactive With Human T-Lymphotropic Retroviruses 
(HTLV-tll) in the Serum of Patients Wrth Aids", 
Science 224. 506-508 (1984)]. Envelope gene pro- 
ducts of HTLV demonstrated antigenicities cross- 
es reactive with antibodies in sera from adult T-cell 
leukemia patients [Kiyokawa, T. et al.. "Envelope 
proteins of human T-cell leukemia virus: Expres- 
sion in Escherichia coir and its application to stud- 
ies of env gene functions". PNAS (USA) 81, 6202- 
50 6206 (1984)]. Adult T-cell leukem;as (ATL) differ 
from acquired immune deficiencv syndrome - 
(AJDS) in that HTLV-I causes T-cell ~aiignancies. 
that is uncontrolled growth of T-ceN -n AiCS rather 
than cell growth there is cell ceatn in 'act this 
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Recombinant acquired immune deficiency syndrome (AIDS) viral envelope protein and method of 
CD testing for AIDS. 
O) 

q@ Recombinant envelope proteins of the etiologic 

agent of acquired immune deficiency syndrome - 
"(AIDS) useful in diagnosis and therapy of AIDS and a 

method for their preparation are described. Proviral 

DNA is transferred into a host cell after engineering 



into an expression vector which produces :ne en- 
velope protein. A method of testing human dtood for 
the presence of antibodies to the AIDS virus using 
the AIDS envelope protein is also described 
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